De-identification of primary care electronic medical records free-text data in Ontario, Canada
نویسندگان
چکیده
BACKGROUND Electronic medical records (EMRs) represent a potentially rich source of health information for research but the free-text in EMRs often contains identifying information. While de-identification tools have been developed for free-text, none have been developed or tested for the full range of primary care EMR data METHODS We used deid open source de-identification software and modified it for an Ontario context for use on primary care EMR data. We developed the modified program on a training set of 1000 free-text records from one group practice and then tested it on two validation sets from a random sample of 700 free-text EMR records from 17 different physicians from 7 different practices in 5 different cities and 500 free-text records from a group practice that was in a different city than the group practice that was used for the training set. We measured the sensitivity/recall, precision, specificity, accuracy and F-measure of the modified tool against manually tagged free-text records to remove patient and physician names, locations, addresses, medical record, health card and telephone numbers. RESULTS We found that the modified training program performed with a sensitivity of 88.3%, specificity of 91.4%, precision of 91.3%, accuracy of 89.9% and F-measure of 0.90. The validations sets had sensitivities of 86.7% and 80.2%, specificities of 91.4% and 87.7%, precisions of 91.1% and 87.4%, accuracies of 89.0% and 83.8% and F-measures of 0.89 and 0.84 for the first and second validation sets respectively. CONCLUSION The deid program can be modified to reasonably accurately de-identify free-text primary care EMR records while preserving clinical content.
منابع مشابه
Trends in the use of electronic medical records.
A comparison between the results of the 2007 and the 2010 National Physician Survey (NPS) shows that exclusive use of electronic medical records (EMRs) by family physicians, general physicians, and other specialists across Canada has increased from 10% to 16%. The province of Alberta leads the way with 28% of physicians exclusively using EMRs, followed by Ontario (20%) and British Columbia (19%...
متن کاملAdoption of Electronic Personal Health Records in Canada: Perceptions of Stakeholders
Background Healthcare stakeholders have a great interest in the adoption and use of electronic personal health records (ePHRs) because of the potential benefits associated with them. Little is known, however, about the level of adoption of ePHRs in Canada and there is limited evidence concerning their benefits and implications for the healthcare system. This study aimed to describe the current ...
متن کاملIdentifying cases of congestive heart failure from administrative data: a validation study using primary care patient records.
INTRODUCTION To determine if using a combination of hospital administrative data and ambulatory care physician billings can accurately identify patients with congestive heart failure (CHF), we tested 9 algorithms for identifying individuals with CHF from administrative data. METHODS The validation cohort against which the 9 algorithms were tested combined data from a random sample of adult pa...
متن کاملStrategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.
BACKGROUND De-identification and anonymization are strategies that are used to remove patient identifiers in electronic health record data. The use of these strategies in multicenter research studies is paramount in importance, given the need to share electronic health record data across multiple environments and institutions while safeguarding patient privacy. METHODS Systematic literature s...
متن کاملImproving Care for the Frail in Nova Scotia: An Implementation Evaluation of a Frailty Portal in Primary Care Practice
Background Understanding and addressing the needs of frail patients has been identified as an important strategy by the Nova Scotia Health Authority (NSHA). Primary care (PC) providers are in a key position to aid in the identification of, and response to frailty as part of routine care. Unlike singular chronic conditions such as diabetes and hypertension which garner a disease-based appr...
متن کامل